Amendments to the Claims: 

1. (Currently Amended) A system for interactive language instruction 
comprising: 

a first module configured to convert input text to audible speech in a selected 
language, the audible speech being patterned after a model; 

a user interface configured to receive utterances spoken by a user in response to a 
prompt to replicate the audible speech; and, 

a second module configured to recognize the utterances and provide feedback to the 
user, the feedback being comprised of a confidence measure reflecting a precision at which 
the user replicates the audible speech in the selected language based on a comparison of the 
utterances to one of the audible speech and the modeL wherein the confidence measure is 
provided as scores for replication of at least one of paragraphs, sentences, words and sub- 
words . 

2. (Previously Amended) The system as set forth in claim 1 further 
comprising a third module synchronized to the first module for producing an animated image 
of a human face and head pronouncing the audible speech. 

3. (Original) The system as set forth in claim 2 wherein the animated image 
of the human face and head portrays a transparent face and head. 

4. (Original) The system as set forth in claim 2 wherein the first and third 
modules further include controls to control one of the volume, speed, and vocal 
characteristics of the video image and the audible speech. 
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5. (Original) The system as set forth in claim 1 wherein the model is one of a 
predictive model, phoneme model, a diphone model, and a dynamically generated model. 

6. (Original) The system as set forth in claim 1 wherein the first module 
includes files storing model pronunciations for words comprising the input text. 

7. (Original) The system as set forth in claim 1 further comprising lesson 
files wherein the input text is based on data stored in the lesson files. 

8. (Original) The system as set forth in claim 1 wherein the input text is 
based on data received from a source outside of the system. 

9. (Original) The system as set forth in claim 1 wherein the system further 
includes dictionary files. 

10. (Original) The system as set forth in claim 1 wherein the system further 
comprises a record and playback module. 

1 1 . (Original) The system as set forth in claim 1 wherein the system further 
includes a table storing mapping information between word subgroups and vocabulary words. 

12. (Original) The system as set forth in claim 1 wherein the system further 
includes a table storing mapping information between words and vocabulary words. 
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13. (Original) The system as set forth in claim 1 wherein the system further 
includes a table storing mapping information between words and examples of parts of speech. 



14. (Original) The system as set forth in claim 1 wherein the system further 
includes tables of punctuation. 

15. (Original) The system as set forth in claim 1 wherein the system includes 
specific pronunciation files. 

16. (Currently Amended) A system comprising: 

a first module configured to convert input text to audible speech in a selected 
language, the audible speech indicative of a model; 

a second module synchronized to the first module, the second module producing an 
animated image of a human face and head pronouncing the audible speech; 

a user interface positioned to receive utterances spoken by a user in response to a 
prompt to replicate the audible speech; and, 

a third module configured to recognize the utterances and provide feedback to the 
user, the feedback being comprised of at least one of a score, an icon and an audio segment 
reflecting a precision at which the user replicates the speech in the selected language based 
on a comparison of the utterances to one of the audible speech and the model , wherein the 
feedback is provided for replication of at least one of paragraphs, sentences, words and sub- 
words . 

17. (Currently Amended) A method for voice interactive language instruction 
comprising: 
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converting input text data to audible speech data; 

generating audible speech comprising phonemes based on the audible speech data; 

outputting the audible speech through an audio output device; 

generating an animated image of a face and head pronouncing the audible speech; 

synchronizing the audible speech and the video image; 

prompting a user to replicate the audible speech; 

recognizing utterances generated by the user in response to the prompting; 
comparing the audible speech to the utterances; and, 

providing feedback to the user based on the comparison, the feedback comprised of at 
least one of a score, an icon and an audio segment reflecting a precision at which the user 
replicates the audible speech , wherein the feedback is provided for replication of at least one 
of paragraphs, sentences, words and sub-words . 

18. (Original) The method as set forth in claim 17 further comprising 
receiving the input text from one of a network, a stored lesson file, a scanner, and the 
internet. 

19. (Original) The method as set forth in claim 17 wherein the feedback 
comprises providing a playback of selected portions of the audible speech and utterances. 

20. (Currently Added) The system as set forth in claim 1 wherein the confidence 
measure comprises at least one of a numeric score, an icon, and an audio segment. 
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21. (New) A system comprising: 

a first module configured to convert input text to audible speech in a selected 
language, the audible speech indicative of a model; 

a second module synchronized to the first module, the second module producing an 
animated image of a human face and head pronouncing the audible speech; 

a user interface positioned to receive utterances spoken by a user in response to a 
prompt to replicate the audible speech; 

a third module configured to recognize the utterances and provide feedback to the 
user, the feedback being comprised of at least one of a score, an icon and an audio segment 
reflecting a precision at which the user replicates the speech in the selected language based 
on a comparison of the utterances to one of the audible speech and the model; and 

a record and playback module. 

22. (New) A method for voice interactive language instruction comprising: 
converting input text data to audible speech data; 

generating audible speech comprising phonemes based on the audible speech data; 

outputting the audible speech through an audio output device; 

generating an animated image of a face and head pronouncing the audible speech; 

synchronizing the audible speech and the video image; 

prompting a user to replicate the audible speech; 

recognizing utterances generated by the user in response to the prompting; 
comparing the audible speech to the utterances; and, 

providing feedback to the user based on the comparison, the feedback being 
comprised of at least one of a score, an icon and an audio segment reflecting a precision at 
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which the user replicates the audible speech and the feedback being further comprised of a 
playback of selected portions of the audible speech and utterances. 

23. (New) A system for interactive language instruction comprising: 
means for converting input text audible speech in a selected language, the audible 

speech being patterned after a model; 

means for receiving utterances spoken by a user in response to a prompt to replicate 
the audible speech; 

means for recognizing the utterances and providing feedback to the user on each sub- 
word or phoneme portion of the utterances, the feedback being comprised of a confidence 
measure reflecting a precision at which the user replicates the audible speech in the selected 
language based on the comparison of the utterances to one of the audible speech and the 
model, wherein the confidence measure is provided as scores for replication of paragraphs, 
sentences, words, sub-words or phonemes. 

24. (New) The system as set forth in claim 23 wherein the feedback further 
comprises audio playback. 
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